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Abstract. In this paper we propose a deterministic algorithm for ap- 
proximate counting the fc-colourings of a graph G. Wc consider two cate- 
gories of graphs. The first one is the sparse random graphs Gn^d/n, where 
each edge is chosen independently with probability d/n and d is fixed. 
The second one is a family we call locally a-dense graphs of bounded 
maximum degree A. A graph G = {V, E) in this family has following 
property: For all {u, v} £ E the number of vertices which are adjacent 
to V but not adjacent to u are at most (1 — a)^, where a £ [0, 1] is a 
parameter of the model. 

Our algorithm computes in polynomial time a (1 ± n~''^'^^)-approximar- 
tion of the logarithm of the number of A;-colourings of G„,d/n for fc > 2.1d 
with high probability. Also, for G a locally a-densc graph of bounded 
maximum degree A it computes in polynomial time a (1 ± (log n)^^^'^-*)- 
approximation of the logarithm of the number of k-colourings, for k > 
(2 — a)A. Restricting the treewidth of the neighbourhoods in G we get 
improved approximation. 

Our algorithm is related to the algorithms of A. Bandyopadhyay et al. in 
SODA '06, and A. Montanari et al. in SODA '06. However, we use corre- 
lation decay properties for the Gibbs distribution in a completely different 
manner. I.e. given the graph G = {V,E), we alter the graph structure 
in some specific region A C. V (by deleting edges between vertices of 
A) and then we show that the cfi'ect of this change on the marginals of 
Gibbs distribution, diminishes as we move away from A. Our approach is 
novel and suggests a new context for the study of deterministic counting 
algorithms. 



1 Introduction 

For a graph G = {V, E) and a positive integer fc, a proper fc-colouring is an 
assignment a : V ^ [k] (we use [k] to denote {!,..., /c}), such that adjacent 
vertices receive diflFerent members of [k], i.e. different colours. Here we focus on 
the problem of counting the fc-colourings of G. 

Typically, the set of colourings is exponentially large and exhaustive search 
is computationally prohibited. In [21] L. Valiant introduced the notion of #P- 
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hardness and showed that exact counting of A:-colourings is #P-complete. Mod- 
ulo a complexity theoretic conjecture the problems in #P do not admit a poly- 
nomial time algorithm. Research focuses on approximate counting algorithms. 

Usually, a counting problem is reduced to computing certain marginal prob- 
abilities. Typically we estimate these marginals by using some sampling algo- 
rithm. The most powerful method for sampling is the Markov Chain Monte 
Carlo (MCMC). There the main technical challenge is to establish that the un- 
derlying Markov chain mixes in polynomial time (see [15]). The MCMC method 
gives probabilistic approximation guarantees. 

Recently, new approaches were proposed for deterministic counting algo- 
rithms by [3] and [24]. This new approach links the correlation decay to comput- 
ing efficiently marginals of Gibbs distributions. The correlation decay concept, 
also called spatial correlation decay or spatial mixing is related to notions such 
as uniqueness of Gibbs measures in infinite graphs (see [10]). 

The two algorithms in [3, 24] suggest two different approaches for comput- 
ing Gibbs marginals. In [3] the computation of marginals is done w.r.t. the 
graph-theoretic structure of the graph. In [24] the problem is reduced, first, 
to computing Gibbs marginals on some appropriately defined tree where some 
boundary conditions are imposed. These two different approaches require two 
different kinds of spatial correlation decay conditions. 

The second approach gives better polynomial time approximations, however, 
it requires stronger correlation decay conditions. Attempts to establish such 
strong conditions were successful for "two-spin" cases, e.g. independent sets, 
matchings, e.t.c. (sec [24,4,18]). For the "multi-spin" cases, such as colourings, 
things seem harder. The best algorithm of this category for counting fc-colourings 
requires k > 2. 8 A and girth at least 4 (see [9]) ^. 

In this paper we propose a deterministic algorithm for approximate counting 
the fc-colourings of a graph G. The algorithm computes Gibbs marginals similarly 
to [3, 17], i.e. it fits to the first of the two approaches described above. However, 
our approach is novel, as we use spatial correlation decay in a completely different 
manner than that in [3, 17]. The spatial correlation conditions we require seem 
to be weaker than those in [3, 17]. 

In most of the cases considered, here, our algorithm outperforms the corre- 
sponding MCMC algorithms in terms of minimum colours required for polynomial- 
time execution. 

Even though our approach is general and applies to any graph, here, we 
consider two, very interesting, cases: The first one is the sparse random graph 
Gn4/n-> where each edge is chosen independently with probability d/n and d is 
fixed. The second is a family of graphs that we call locally a-dense graphs. 

Definition 1.1 (locally dense). A graph G = {V,E) is locally a-dense if the 
following holds: For all {u,v} G E, v has at most (1 — a) A neighbours which 
are not adjacent to u, where A is the maximum degree of G and a G [0, 1] is a 
parameter of the model. 

^ The best MCMC algorithm for sampling fe-colourings of general graph, mixes in 
polynomial time for k > 11/6 A (see [22]). 
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Note that the above family includes some G = {V, E) having the following prop- 
erty: for all V &V, the minimum degree of the subgraph induced by the neigh- 
bours of V is 0{A), where A is the maximum degree in G. However, the family of 
locally a-dense graphs is not restricted only to this kind of graphs, it is richer. In 
Section A. 3 we show that this family allows graphs even with triangle free neigh- 
bourhoods. In this case the vertices of the triangle free neighbourhood should 
have sufficiently small degrees, i.e. (1 — a)A. 

The motivation for studying the problem of counting fc-colourings of graphs 
in these two categories is the following: As far as Gn,d/n is concerned, the chal- 
lenging task is the actual computation of Gibbs marginals for the fc-colourings. 
Generally, the efficient computation of Gibbs marginals for the solutions of ran- 
dom constraint satisfaction problems has been central in Statistical Physics as 
well as in Computer Science, for the so called on average case analysis of the 
algorithms. Moreover, accurate computation of such marginals has been related 
to the empirical success of heuristics such as Survey Propagation and Belief 
Propagation in finding solution of instances of random constraint satisfaction 
problems (see [16]). 

As far as locally dense graphs arc regarded, we note that in the MCMC 
literature for sampling colourings, there is a large number of results showing that 
we can have polynomial mixing with k < 11/6 A for special cases of graphs. In all 
these results, the common graph property required is that the neighbourhoods 
are fairly sparse, e.g. sufficiently large girth, planarity, e.t.c. (see [II, 13, 8]). Here 
we show that we can count colourings below the barrier of II/6Zi, for bounded 
degree graphs, even when there exist very "dense" neighbourhoods. 

1.1 Results 

Let Z{G, k) denote the number of /c-colourings of the graph G. Z{G, k) is also 
called partition function. Our algorithm computes an approximation for the log- 
partition function log Z{G, k). 

Definition 1.2. The value ^{G,k) is defined to he e- approximation of the log- 
partition function log Z{G,k) if 

(I-e)l^^^<^(G,.)<(l + e)l^^^. 

n n 

First we apply our algorithm on and we show the following theorem. 

Theorem 1.1. Taking sufficiently large d and k > 2. Id, then, with probabil- 
ity at least 1 — n~'^^ our counting algorithm computes a n~'^^ -approximation of 
logZ(G'„ (i/„, fc), in time 0(n'^), with e\, ei and c positive real numbers which 

depend on k. 

Furthermore, we will show that the arithmetic value that is computed by the 
algorithm is with high probability^ (w.h.p.) n(log (fc(l — ± o(l)). This 

^ With probability that tends to one as n tends to infinity. 
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result can, also, be derived by the second moment method, see the work of 
Achlioptas and Naor in [1] . Note that this result here is derived algorithmically, 
i.e. wc provide a procedure which actually counts the number of colourings of 
Gn,d/n- Finally, we show that there is a set of graphs Sd with Pr[Gn,d/n G •S'd] = 
1 — n~°'^ such that for every G G Sd we can verify in polynomial time whether 
log Z{G, k) is equal to r?,(log (fc(l - l/kY/"^) ± o(l)) or not, when k > 2.1d. 
For the locally dense graphs wc get the following theorem. 

Theorem 1.2. For G = {V,E) a locally a-dense graph of maximum degree A, 
a fixed constant^, and for k > {2 — a)A the counting algorithm computes a 
{log n)~^^^^ -approximation oflog{Z{G,k)) in polynomial time. 

For the non too restrictive case where a > 1/6 the lower bound for k is less than 
11/6Z\, i.e. less colours than what the best MCMC requires for polynomial-time 
sampling for this family of graphs. Restricting further the locally dense graph 
we get an even more accurate counting. 

Corollary 1.1. For G = {V,E) a locally a-dense graph with the additional 
property that the 0{logn) neighbourhood of each v & V has bounded treewidth, 
the algorithm computes a n~^^^^ -approximation oflog{Z{G,k)) in polynomial- 
time for k > {2 — a)A. 

It is worth mentioning the least number of colours k that give polynomial 
mixing for the MCMC sampling for the two families of graphs we consider here. 
For Gn d/m k is a constant which depends on d with high probability (sec [19]). 
For the second family k is the same as the general boimd for MCMC sampling 
colourings of a graph with maximum degree A, i.e. k > ll/6Zi (see [22]). 

We should note that the non MCMC algorithm for sampling k-colourings of 
Gn,d/m with k = k{d) constant^ in [7], can be transformed so as to estimate the 
log Z{Gn,d/nj in a same manner as in [3]. In this case, it should hold k > d^ 
which is outperformed by this algorithm. 

1.2 Contribution 

As in [3, 17] , the objective of our algorithm is to compute certain Gibbs marginals. 

The time for computing each of these marginals depends on the number of the 
vertices and the structure of the underlying graph G. Usually, this computation 
requires exponential time. Correlation decay is exploited to get polynomial time 
approximations for the marginals. In particular, correlation decay is used to re- 
strict the necessary computations for each marginal to a small subgraph of G 
(i.e. instead of G we consider a small subgraph of G) where the computation 
is carried out in polynomial time. Crucially, the error in the estimation of the 
marginals depends on quantities which express spatial correlation decay. 

In [3, 17] the correlation decay conditions that are used involve imposing 
boundary conditions (fixing the colouring of specific vertices) and showing that 

^ Z\ is independent of the size of G. 
A very large constant that depends on d. 
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its effect on the Gibbs distributions decays as we move away from the bound- 
ary region. This approach is equivalent to estabhshing Dobrushin's Uniqueness 
Condition for infinite graphs (see [10]). 

Here, we follow a completely different approach for the use correlation decay. 
The following example that illustrates the ideas that underlay our approach. 

Example. Consider two graphs Ga = (V, Eg) and Gb = iy, Eb) and the region 

^ C V. Assume that Ea C Eb while Eb\Ea includes only edges of Gb that arc at 
graph distance at least t from if'. Let and /Xb be the Gibbs distributions of the 
fc-colourings of Ga and Gb, correspondingly. For sufficiently large k, t we show 
that the marginal Gibbs distribution of the colourings of that is specified by 
lia and Hb, correspondingly, are very close, in some sense. 

Essentially, the correlation decay conditions we use here involve imposing a 
specific graph structure to some region by deleting edges and by showing that 
the effect of this change on the Gibbs distribution decays as we move away from 
the altered region. 

The spatial correlation decay conditions we use seem to be weaker than those 
in [3,17]. More specifically, we conjecture that our conditions are equivalent 
to what is known as non-reconstructibility conditions^ for the colourings of G 
with a sufficiently large rate of decay (see [10]). We should remark that our 
conjecture would follow by showing that certain non-reconstructibility conditions 
are monotone in the graph structure. For further discussion see Section 3.1. 

Theorem 3.1 and Theorem 3.2 provide the correlation decay conditions needed. 
However, in order to derive the results in Section 1.1 we need to provide appro- 
priate bounds for the spatial correlation conditions. To achieve this we make a 
novel use of "Disagreement percolation" [5]. I.e. express the decay of correlation 
in terms of percolation-probabilities on graphs (see Section 3.2). This technique 
is general and simple, e.g. there is no need for restrictions on the graph struc- 
ture. Furthermore, it allows to express the corresponding bounds in terms of the 
degree of each vertex, not the maximum degree ^. Finally, in the most cases the 
correlation decay bounds will be monotone in the graph structure. 

We should remark that one thing is the correlation decay conditions stated 
by Theorem 3.1 and Theorem 3.2 and an other thing is the bounds provided by 
the use of Disagreement Percolation. 

1.3 Structure of the paper 

The rest of the paper is organized as follows: In Section 2 we present some basic 
concepts and describe the counting problems w.r.t. these concepts. In Section 
3 we present our counting algorithm and provide the theorems that relate its 
accuracy with certain spatial correlation decay. Also, we provide results which 
are used for bounding spatial correlation decay (in Section 3.2). 

® Non-reconstructibility is equivalent to extrimality of Gibbs measure for infinite 
graphs, see e.g. [10]. 

Property which will be exploited in the case of 
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In the Appendix we provide in detail the proofs of the results presented in 
the main part of the paper. In particular, in Section A.l we discuss the technical 
details for applying the counting algorithm on G^.d/n- Wc prove Theorem 1.1 
and other related results. We do the same for the locally a-dense graphs in 
Section A. 2. In Section A. 3 we present some remarks on locally dense graphs 
which provide a better view of this family of graphs. 

In Section A. 4 we prove the results that appear in Section 3.2 for bounding 
spatial correlation decay. In Section A. 5 we give a detailed proof of the theorems 
and the lemmas that have not been proved to that point. 



2 Basics and Problem Formulation 

Let us introduce some notation, first. For the graph G = {V, E) and the integer 
k let f2{G^ k) be the set of all proper fc-colourings of G. Wc always assume that 
k is such that Z{G,k) = \ f2{G,k)\ > 0. Also, for W C V and a G Q{G,k) we 
denote with Q{G, k, a^) the set of proper fc-colourings of G which are consistent 
with a on the colour assignments of the vertices in ^. 

Our algorithm is studied in the context of finite spin-systems, a concept that 
originates in statistical physics. In particular, we use the finite colouring model. 

The Finite Colouring Model with underlying graph G = {V, E) that uses k 

colours is specified by a set of ^\sites" , which correspond to the vertices of G, a 
set of ''spins'\ i.e. the set [k], and a symmetric function U : [k] x [k] — > {0, 1} 
such that for [fc] 



Uii,j) 



1 iii^j 
otherwise. 



It holds that \V\ = n with n finite. 

A confi.guraiAon a E [k]^ of the system assigns each vertex ("sitc")^ x € V the 
colour ( "spin value" ) ax £ [k] . The probability to find the system in configuration 
a is determined by the Gibbs distribution, which is defined as 

where Z{G, k) is the partition function. If and only if cr is a proper fc-colouring 
of G, it holds Y\{x y}eE ^i'^x^^^y) = 1- This implies that the Gibbs distribution 
is the uniform distribution over J?(G, fc) ^. 

Boundary condition corresponds to fixing the colour assignment of a specific 
"boundar]/' vertex set of G; the term "/ree boundar-i/' implies that there is no 
boundary condition imposed. 



For convenience, when we refer to a site of a system we use the term vertex, while 
when we refer to a spin we use the term colour. 
® The definition of Gibbs distribution is valid only for k so large that \0{G, k)\ > 0. 
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For convenience we denote with PCS{G,k,(jxp) the proper colouring model 
with underlying graph G = (F, E) which uses k colours and there is a region 
If" C T/ where wc impose boundary conditions ct^/, for a G f2{G,k). The third 
parameter of PCS-notation is omitted, when we assume free boundary. 

The probability for PCS{G, k, a^) ^ to be in configuration r G [k^ is given 
by the corresponding Gibbs distribution /i"^* , i.e. 

\ otherwise 

where Z{G,k,(T^) ~ \f2{G,k,a^)\. If n, /x'^*' are the Gibbs distributions of 
PCS{G, k) and PCS{G, k, a^p), correspondingly, then for r € [k\^ it holds that 
H-^{t)= ^x{T\ap). 

Another concept we will need is that of the sequence of subgraphs. 

Definition 2.1 (Sequence of subgraphs). For the graph G = {V,E), let 
Q{G) = {Gi — {V,Ei)}l^Q denote a sequence of subgraphs of G which has the 

following properties: 

— Go is some subgraph of G 

— Ei c Ei+\ for Q <i <r and Er = E. 

— the term Gj+i compared to Gi has an additional edge, the edge = {vi,Ui}. 

When we refer to G{G) we specify the graph Go while we, usually, assume that 
there is some arbitrary rule which gives us some specific sequence of subgraphs. 

For the graph G and some integer k such that Z{G, k) > 0, assume that we 
want to compute Z{G, k). Consider a sequence of subgraphs Q{G) such that Go 
is empty, e.g. Q{G) = Go, . . . , G^. For each graph in Q{G) consider the systems 
PCS{Gi, k), for < i < \E\. For these systems, we define the following: 

Definition 2.2. W.r.t. to Q{G) we define the sequence of systems PCS{Go, k),. . . , 
PCS{G, -k)- Furthermore, let Xi denote the configuration of PCS{Gi,k) and Hi 
denote the Gibbs distribution specified by PCS{Gi, k) for Q <i <r. 

For A V, let Xi{A) e [k]^ be the random variable such that if Xi = a, 
then Xi{A) — a a, for any a G [fc]^. 

Lemma 2.1. Considering the all above definitions, the following holds 

\E\-1 

\Z{G, k)\ = fc" • n P^iMvi) 7^ Xi{ui)] (2) 

i=0 

where the vertices Vi and Ui are incident to ^i (see Definition 2.1). 

The proof of the above lemma appears in Section A. 5. 4. 

The above lemma implies that we can have Z{G, k) by computing the proba- 
bility terms in (2). For the cases of graphs we are interested here these probability 

8 Where W<^V wdcj£ n{G, k) 
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terms we have no polynomial-time method for their exact computation. Thus, 
we derive only polynomial-time approximations for this probabilities. 

For the distributions Va, on [fc]^, let H^a — I'bll denote their total variation 
distance, i.e. 



For yl C y let lli'a — J^blU denote the total variation distance between the pro- 
jections of Va and Vb on [k]^. 

Wc close this section with some additional notation. For the graph G = 
{V,E), the vertex v €V has degree Ay and the maximum degree of G is equal 
to A. Also, for ^ C F and some integer t > let L{A, t) = {v & V\dist{A, v) = 
t}, where dist{A,v) denotes the graph distance between A and v. Let, also, 
B{A,t) = Ut'<tL{A,t') and BiA,t) V\B{A,t). Also, w.r.t. to the sequence of 
subgraph Go, Gi, . . . , Gr-i let Li{A, t) = {v G V\dist{A, v) = t in Gj}, similarly 
we define Bi{A,t) and Bi{A,t). 

3 Counting Schema 

We are given a graph G = {V, E) and some integer k such that Z{G, k) > 0. In 
this section we present our algorithm that estimates logZ(G, k). 

Consider a sequence of subgraphs Q{G) = Go,. . . ,Gr with Go being empty. 
For a given integer t and every graph Gi in G{G) consider a, new, sequence of 
subgraphs Q{Gi) = G^^o, • • • , Gi^n defined as follows: Gi^n, is the graph Gi while 
Gj^o is derived from Gj be removing all the edges between the sets Lii^i, t) and 
Li{i'i,t-\-l). We remind the reader that the intermediate terms of the sequences 
of subgaphs are constructed by some arbitrary rule. The value of t is a parameter 
of our algorithm. 

W.r.t. the sequences Q{G) = Go,...,Gr and Q{Gi) = Gift, ... ,Gi^ri, it 
should be clear to the reader what the symbols fii, fiij, Xi, Xij, and for 
< J < and < i < r stand for The counting algorithm follows: 

Counting Schema 

Input: G, k, t. 

Set Z{G,k) = 1. 
For 0<«<r — Ido 

— Compute the exact value of Pr[Xi,o{vi) ^ Xi^lui)]. 




(3) 



Set Z{G, k) = Z{G, k) ■ Pr[Xi,o{vi) ^ Xi^o{ui)]. 



End For. 




Otherwise see Section 2 
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The above algorithm is based on the counting to marginal reduction stated 
in Lemma 2.1, while Pr[Xi{vi) ^ Xi{ui)] is approximated by Pr[Xifi{vi) ^ 
Xifi{ui)]. Note that the computation of Pr[Xifi{vi) ^ Xi_Q{ui)\ is done w.r.t. 
the induced subgraph of Gj which includes only vertices that are within graph 
distance t from tpj. 

The computations for Pr\Xifi(\Ji) ^ Xix)(ui)\ will involve graphs with a num- 
ber of vertices of order either ©(logn) or a 0(n^^*^^)). For the first case Z(G,,o, fc) 
will be polynomially large and the computation of Pr\Xi^Q{vi) 7^ Xj^o(wi)] will 
be carried out by exhaustive enumeration. For the later case, Z{Gifl,k) may 
become exponentially large. There we require Gj^o to have a special structure 
which allows a dynamic programming algorithm to carry out the computations 
in polynomial time. The actual choice of the algorithm is made ad-hoc. 

Two natural questions arise for the counting algorithm. The first one is about 
its accuracy, i.e. how close ^logZ{G,k) and ^\og Z{G,k) are. The second one 
is about the time complexity. It should be straightforward that the execution 
time is dominated by the computations for Pr[Xi^o{vi) ^ Xifl{ui)], for each i. 
When we consider a specific category of graphs we discuss in detail about the 
choice wc make for the algorithm that computes the probability terms. In the 
remainder of this section we will focus on accuracy issues. 

Theorem 3.1. For the Counting Schema it holds that 

r-l 



(4) 



when each of the sumands in the r.h.s. is sufficiently small. 

The proof of Theorem 3.1 appears in Section A.5.1. 

So as to show that the estimation of logZ(G, fc) is accurate, typically, we 
do the following: We derive a constant lower bound for Pr[Xi{vi) ^ Xj(uj)] in 
the denominator. Then, we show that Pr[Xi{vi) ^ Xj(wi)] and Pr[Xifi{vi) 7^ 
Xifi{ui)] are sufficiently close. There we use the following theorem. 

Theorem 3.2. For {) < i < r - I it holds that 

\Pr[Xi{vi) ^ Xi{ui)\-Pr[Xifi{vi) ^ Xi^Q{ui)\\<y^Cij V max \\^J'^o{■\c^x)-|J^ij{■\Ta,)\Wi 

(5) 

where C'ij = Cij{Gij,k) = max^^t^^ki {{Pr[Xij{uij) = s\Xij{vij) ^ t]) ^} 
and ri is the number of terms in the sequence Q{Gi). 

The proof of Theorem 3.2 is given in Section A. 5. 2. 
3.1 Remarks on the Spatial Conditions 

It is interesting to discuss the implications of the spatial mixing conditions re- 
quired by Theorem 3.1 and Theorem 3.2. Assuming that every Cij is a sufficiently 
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small constant, which will be the case here, (5) suggests that 

-\logZ{G,k)-logZ{G,k)\ < f{G,t) ■ .max - My (6) 

where /(G, t) is a quantity that grows linearly with the number of summads in 
(4) and (5). It is direct that a sufficient condition for the counting schema to be 
efficient and accurate is that for every < i < r and < j < to have 

, J|ftj(-|o-x)-ftj(-|Ta:)||i,y({x},t) <C'o-exp{-ao-i} (7) 

for appropriate fixed real numbers Co > and oq > 0. Clearly, the above in- 
equality makes sense when /(G, t) in (6) grows with t slower than the inverse of 
the r.h.s. in (7), i.e. the quantity oq in (7) is sufficiently large. We also have the 
following. 

Lemma 3.1. For any graph G = {V,E) and k, let ji he the Gibbs distribution 
of PCS{G, k). For every x gV and ACV it holds 

^ M^)-IIm(-|A)-M-)IU< max |KM-M(-|T.)IU<2fc. ^ m(A)-|IM-I^)-M- 

For a proof Lemma 3.1 see in Section A. 5. 5. 

By Lemma 3.1 the condition in (7) is equivalent to the following: For Q <i <r 
and < i < rj 

"^^.^.^ M Yl My (^) • llAiy (-l^) -M*i(-)llx < Gexp{-a-t} (8) 

for appropriate fixed G, a > 0. The above is known as non-reconstructibility 
condition for the /c-colourings of Gij (with exponential decay). This condition 
implies that a "typical" colouring of Lij{{x}, t) should have small impact to the 
Gibbs marginal on x. 

Under the assumption that the non-reconstructibility condition is monotone 
in the graph structure, the condition in (8) is equivalent to the condition 

max J2 M^)-|IM-|^)-M-)llx<Gexp{-a.O (9) 

where \i is the Gibbs distribution specified by PCS(G, k). Note that the above 
condition is weaker than the one required in [3, 7, 17] i.e. 

max ||M(-ki:,({c.},t))-M(-)IU < G' exp{-a' • i} (10) 

for fixed real numbers G' > and a' > 0. Note that the above condition requires 
that the effect of any "arbitrary" colouring of L{{x}, t) should have small impact 
on the Gibbs marginal on x. 

To our knowledge establishing such a monotonicity property for the non- 
reconstructibility condition is an open problem. 
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3.2 Bounds for Spatial Correlation decay 



In this section we provide two theorems that can be used to derive upper bounds 
for the quantities | |/Zij(-|cra;) — /x,j(-|Ta;)||a^in (5). The derivation of the bounds 
arc of independent interest than the discussion in the Section 3.1. The technique 
that is used by the two theorems is the well known ^^disagreement percolation" 
coupling construction, [5] . Here, we present only the statements of the theorems 
while their proofs arc presented in Section A. 4. 

Note that the disagreement percolation allows to express the quantities in 
the r.h.s of (5) in percolation-like probabilities, which in turn can be expressed 
in terms of the number of colours k and the degree Ay of each v lE V in Gij . 
To avoid the notation with the indices we consider a general graph G = 
{V,E), instead. 

Let Vq, for q G [0, 1]^, denote a product measure under which the vertex 
V ^ V is disagreeing with probability and non- disagreeing with probability 
1 —Qv A path of disagreement in G is any simple path which has all its vertices 
disagreeing. 

Theorem 3.3. For the graph G = {V, E) let /i he the Gibbs distribution specified 
by PGS{G,k). W.r.t. G, x G V and k let Vq denote the product measure such 
that for V G of degree Ay, we have Qy — while = 1. It holds that 

max ll/^C'Ica;)— /x(-|?7a;)||yi <Vq[3 path of disagreement connecting {x} and A in G]. 

(11) 

where A CV. 

The proof of Theorem 3.3 is given in Section A. 4. 

Let BV{x, A) be the set of all paths connecting x with A such that the vertices 
of each path form an induced subgraph of G which is bipartite. The analysis in 
the proof of Theorem 3.3 suggests that we can restrict the "disagreement" to 
the paths in BV{x,A). For each M e B'P{x,A) let 




1 if M is a path of disagreement 
otherwise. 



Theorem 3.4. For the graph G — {V, E) let fi be the Gibbs distribution that 
is specified by the system PCS{G,k). W.r.t. G and any x e V, consider the 
product m.easure Vq. with q G [0, 1]^, such that for v € V\{a;} of degree Ay, we 
have qy = while q^ = I. It holds that 

IIM-M-m(-|%)IU< Y1 (12) 

where A <ZV and Eq[-] is the expectation taken lu.r.t. to probability measure Vq. 
The proof of Theorem 3.4 is, also, presented in Section A. 4. 

Acknowledgement. The author would like to thank Amin Coja-Oghlan for the 
numerous fruitful discussions and for the encouragement to study this problem. 
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Appendix 



A.l Application - Gn,d/n case 

In this section we show Theorem 1.1 and the results related to G„,rf/„- For techni- 
cal reasons, which we discuss later, require the following sequence of subgraphs. 

Sequence of subgraphs G{G„^d/n)'- Let r be the greatest index in Q{Gn^d/n)^ 
e.g. Q{Gn d/n) = Gq, . . . , Gr- Thc term Go is empty. Let R be the set of all edges 
in Gn,d/n that do not belong to a cycle of length more than iq iog'(^"d/2) they 
are adjacent to a vertex that belongs to such a cycle. There is an index to such 
that for i > iq, Gi differs from Gi-i in some edge from R, in G{Gn d/ri)- 

For < i < r consider that the sequence of subgraphs Q{Gi) defined as 
follows: Gifi is derived by Gi be deleting all the edges that connect the sets of 
vertices Li(^i,a\ogn) and Li{^i,a\ogn+ 1) where < a < 2\o^d - 

Due to thc fact that k is smaller than thc maximum degree of G^^djn 
there can be cases that (Gjj)~^ and Pr[Xj(i'j) ^ Xi(ui)\ are very small. This is 
undesired since it may have bad effect on the accuracy of the counting. 

The proof of Theorem 1.1 implies that this may occur when the vertices which 
are involved, i.e. Vi,Ui, Vij,Uij, have large degrees and belong to small cycles. To 
avoid such cases, for every edge {v, u} that is adjacent to a sufficiently small cycle 
the estimation of Pr[X{v) ^ X(u)\ is set (1 — 1/fc). Working in this way, all the 
"small" cycles that appear Gn^djn become isolated in Gj, for i < io. This forbids 
any case of having exceedingly small either {Cij)~^ or Pr[Xi{vi) ^ Xi{ui)], for 
i < io. Furthermore, the corresponding error introduced in the estimation of 
\og{Z{Gn,d/n,k)) will be negligible. 

Note that we set the parameter t of the Counting Schema equal to ologn, 
for < a < 2k3ili- ^'or such t, the component in G^.o which contains {vi,Ui} is 
w.h.p. a tree with at most 0(logn) extra edges, where Q < i < io- This allows 
the schema to run in polynomial-time. 

Computing Probabilities. The probability term Pr[Xifi{vi) ^ Xifi{vi)], for 
<i < io, can be computed by using Dynamic Programming. 

More specifically, note that using DP we can compute exactly the number of 
list colourings of a tree T. In the list colouring problem every vertex v £T has a 
set List{v) of valid colours, where List{v) C [k] and v only receives a colour in 
Listiv). For a tree on I vertices, using dynamic programming we can compute 
exactly the number of list colourings in time Ik. 

The connected components in Gj^o that contain {vi^Ui\ is a tree with at 
most 0{\ogn) extra edges, for < i < io, w.h.p.. For such component we can 
consider all < fc^Cog") colourings of the endpoints of the extra edges and for 
each of these colourings recursc on the remaining tree. Since in our case fc is a 
constant, A:^('°^") is a polynomial of n. Thus, w.h.p. the numer of list colourings 
of the connected component, in Gi^o, that contains {vi,Ui} can be counted in 
polynomial time for every i. It is direct that this is sufficient for computing 
Pr[Xi,o{vi) ^ Xi,{ui)] 



" The maximum degree in G„,d/„ is O {j^^^^ w.li.p. (see [14]) 
1^ A similar DP approach is also used in [6] and [7]. 
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In what follows, we provide the pseudocode of the counting schema for Gn,d/n- 
Note that, the input parameter e controls the quality of the approximation of 

log(Z(G„,d/„,A;)). 

Counting Schema ^/n 
Input: Gn,d/n, k, e 

If fc < 2.1d compute log(Z(G„_d/„, A;)) by exhaustive enumeration. 

If e < j2-aiog(fe/2c!) compute log{Z{Gn,d/m k)) by exhaustive enumeration. 
Compute the set of edges R. 

If |-R| > nP'^ make an exhaustive enumeration of colourings. 

Compute the sequence of subgraphs G{Gn,d/n)- 

Set Z(G,fc)-(l-i)"". 
For < i < r - do 

— Compute the exact value of Pr[Xifl{vi) ^ Xi^Q(ui)\. 

- Set i(G, k) = Z{G, k) ■ Pr[Xi^o{vi) Xi,o(ui)]. 

End for. 

Set i-(G,fc)==i-(G,fc)-fc". 
Output: log (i(G, fc)) In. 

Theorem 1.1 follows as a corollary of the following two theorems. 

Theorem A. 1.1. For sufficiently large d, k > 2. Id the counting schema com- 
putes a n~'^^ -approximation of log Z{Gn,d/nik) , with probability at least l — n~^^, 
where ei > and e2 > depend on k. 

The proof of Theorem A. 1.1 appears in Section A. 1.1 and makes a heavy use of 
Theorem 3.3. 

Theorem A. 1.2. There are real constants c > and e > such that the time 
complexity for the counting schema to compute the log-partition Gn,d/n is 0{n'^), 
with probability at least 1 — n~^. 

Proof: Theorem follows directly from the paragraph, "Computing Probabilities" , 
above. o 

Furthermore, the proof of Theorem A. 1.1 implies the following theorem too. 
Theorem A. 1.3. For sufficiently large d and k > 2. Id, w.h.p. it holds that 



logZ(G„,d/„,fc) / , d ^ / 1 

^logfc+-.log^l-- 



<o(l). (13) 



If d is relatively sm,all, then the above relation holds w.h.p. when k = 0{d). 
The proof of Theorem A. 1.3 appears in Section A. 1.2. 

Theorem A. 1.4. For fixed d and integer k > 2. Id, there exist a set of graphs 
Sd with Pr[Gn^d/n & Sd] = 1 — n~'^'^ and a function h{n,d,k) = n~^''^^ such 
that for every G & Sd it can be verified in polynomial-time whether the property 



^og Z{G,k) 



(logfc + ^.log(l-i 



n 

holds or not. 

The proof of Theorem A. 1.4 appears in Section A. 1.3 



<h{n,k,d). (14) 
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A. 1 . 1 Proof of Theorem A. 1 . 1 

First we present a series of lemmas and corollaries that will be used for proof 
of Theorem A. 1.1. In essence in these preliminary results we derive appropriate 
bounds for the quantities | (-Ictj:) — Mij("ka;)ll<?'i) C^ij (defined in Theorem 3.2) 
and Pr[Xi{vi) ^ Xi{ui)], for all i,j, by making extensive use of Theorem 3.3. 

W.r.t. to the graph Gn^d/n and the number of colours k, consider the product 
measure Vq, as defined in the statement of Theorem 3.3. 

Definition A. 1.1. Let q{s,d/n) be the following quantity 



M/»..|:^(:)0'(-r^s(:)(:: 



1-^ 

n 



(15) 



Clearly, q{s,d/n) is an upper bound for the marginal probability of the event 
that a vertex in Gn,d/n is "disagreeing" under the measure Vq when s colours 
are used. For convenience, we will use q{s, d/n) with s not an integer, then we 
assume that we have q{\s\,d/n). 

Lemma A. 1.1. Consider the product measure Vq, w.r.t. the graph Gn,d/n O'^d 
number of colours k. Let it be a permutation of I vertices of Gn.d/n, for < I < 
0{n^^^). Conditioning that the maximum degree of Gn,d/n is at most log^ n, for 
sufficiently large n it holds that 

Vq[n is a path of disagreement] < q{k, d/n)- ■{q{k - rf/100, d/n) + n"°-^^^)' (l+o(l)). 

Proof: Call tt the path that corresponds to the permutation tt, e.g. tt = {xi, . . .Xi). 
Let r be such that 



r 



1 if TT is a path of disagreement 
otherwise. 



Note that the probability for G^^d/n to have a vertex of degree greater than 
log^ n is less than n"'^^"^ . It holds that 



Eq[r] ^{^+ ra"'^^"-' j • Eq[r\i'hB path tt appears in G„,d/„]. 
By the fact that (1 + n-'^^"))' < exp {^-"("^ni/^} = 1 + o(l), we get 

Eq[r] < (1 + o(l)) • • £'q[r|the path tt appears in G„,d/„]. 

It suffice to show that for < / < n^^^ and sufficiently large n, that 

£;q[r|the TT appears in G„ d/n] < qik, d/n) ■ {qik - \d/WO] , d) + n'^-^^^Y . 

(16) 

We show (16) by induction on I. Clearly for I = 0, (16) is true, Assuming that 
(16) holds for / = Iq, we will show that it also holds for I = Iq + 1, as well. 
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We assume that the path (xi, . . . , xi^+i) appears in Gn,d/n- For any vertex u, 

let Du, denote the event that the vertex u is disagreeing. Given that all vertices in 
{xi, . . . ,Xiq} are disagreeing let be the number of vertices in y\{a;i, . . . ,xi^} 
that are adjacent to Xj, for 1 < i < Zq- 

Note that if A^^ = t, then all the possible subsets of V\{xi, . . . , xi„} of cardi- 
nality t are equiprobable to be adjacent to Xi. This implies that the probability 

for xig+i to be adjacent to Xi is "^^J'^ 







Let Aig^i be the number of edges that are adjacent to Xio+i and some vertex 
in . . . ,xig-i}. By the linearity of expectation we have 

E[Ai,+,] < -^^E[A,,] < Con-'/^ log' n (17) 

n — lo 

for a sufficiently large real Cq. The last inequality follows from the assumption 
that < Zo < n^^'^, the fact that the maximum degree is at most log' n and the 
fact that is less than the maximum degree, for z = 0, . . . , ?o- 

We make the simplifying assumption that if the vertex xi^+i is adjacent to 
more than d/100 vertices in {xi, . . . ,xig-i}, then it is disagreeing, regardless 
of the number of adjacent vertices outside the path. By (17) and the Markov 
inequality, for sufficiently large n we get that 

Pr[Ao+i > rf/100] < ^n-V^log^n < n'O ^^ 

Remark. Given that {xi, . . . ,xig} is a path of disagreement, the number of 
edges between xi^+i and the set of vertices V\{xi, . . . ,xig} is dominated by 

Bin,d/ '''' 



We denote with E the event that "(^^i, . . . , .t;„) is a path of disagreement. Z\;,,,^^^j < 
d/100, the edge {xi^, xi„+i} appears in Gn,d/n" ■ The following inequality is 
straightforward. 

Pr[D,,g^, \E] < J2 u \ (D - d/nr~' + 



n—j 



< q{k- d/im,d/n). 

The lemma follows. 



Lemma A. 1.2. For sufficiently large d and k > 2.1d, consider the graph Gn^d/n 
and let fi be the Gibbs distribution specified by PGS{Gn,d/m Por every x €V 
and ^ C L{x, I), for < / < 0(n^/^), there exists a real 7 > 1 such that 



E 



max IIM-ka;) -M-|»?cc)lk 



n 



(18) 



where \ is the number of vertices in \If and C is a sufficiently large real constant. 
Note that the expectation is over the graphs. 



Actually it is dominated by B{n — lo, d/n) which in turn is dominated by B{n, d/n) 
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Proof: Consider an enumeration of all the permutations of I vertices in Gn^d/n 
with first the vertex x and last some vertex of ^. Let 'itq{1),tti{1), . . . be the 
permutations in the order they appear in the enumeration. We will say that the 
Z-path i appears in Gn,d/n if and only if G„^d/n contains a path whose vertices 
appear in the order specified by Tri{l). 

W.r.t. the graph Gn,d/nj consider the product measure Vq as it is defined in 
the statement of Theorem 3.3. Let r'i(Z) be the random variable such that 



m) = 



1 the path that corresponds to Tri{l) is a path of disagreement 
otherwise. 



Let, also, r{l) = ^{1). By Theorem 3.3 we have that for a given instance of 
Gn^d/n it holds 

Taking the average over both the graph instances and the product measure Vq 
we get that 



E 



max \H-Wx) - K-MIW 
(7,77er3(G„,<j/„,fc) 



<j2E[m)]- (19) 



Lemma will follow by bounding appropriately the r.h.s of (19). 

For the moment assume that the maximum degree in Gn,d/n is at most 
log^ n. Later in this proof we remove this assumption. We should mention that 
the probability for the event that the maximum degree in G^^d/n is at most 
log^ n is at least 1 — n~'^^"^ 

Using Lemma A. 1.1 and the fact that there are less than \^\ ■ n'~^ paths of 
length I, between x and ^, we get 

Eq[r{l)] = J2 Eq[m)] < • {q{k - d/100, d/n) + n-O'^^^y _ 

i 

Setting s = k — d/lOO, where k > 2A • d we get 
q{s, d/n) ^ E ('^/")'(1 - '^Z^)""' + E ('^/^)' (1 - '^Z")""' 

^^E • (W(i-w-+ E C 



3=0 j=s/2+l 

< -+z{s/2). 



. ]{d/ny{l-d/ny 



s 



The quantity z{s/2) is equal to the probability for a random variable distributed 
as in B{n, d/n) to be greater than s/2. Using Chernofi^ bounds and for sufficiently 
large d, it holds z{s/2) < Thus, we get that q{s,d/n) +n~°-^^ < 

Eq[r{i)] < ^(1.04)-'. 

With similar arguments we get that the expected number of paths of disagree- 
ment of length I that start from x and get to any vertex in Gn^d/n is (1.04)"'. 
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Let Eg = "there is no path of disagreement that starts from x and has length 
iog(il34) "w^here (7 > is a sufficiently large constant. By the Markov Inequality 
we upper bound the probability of the complementary of Eg, i.e. Vq[Es] < . 
Thus, Vq[Es] > 1 - n-^. Also, it is easy to see that Eq[r{l)\Es] < 2Eq[r{l)]. 

Let r be the number of paths that start from x, have length at most i^|°^04) 
and connect the vertex x with ^. Then 

Clog„/log(1.04) 

Eg[r\E,] < 2^ Yl ^iim)] < Ci^ (L04)-^'°s" 

l=€ logn 

where Ci is a sufficiently large positive constant. Call D the event that the 
maximum degree in Gn^d/n is less than log^ n. Let 

^= max ||/x(-|o-^)-/x(-|?7^)||a)-. 

cr,7/ei7(G,fe) 

It holds that 

E[S]=E[S\ D, E,] Pr[D, E,]+E[S\DU E,] Pr[D U E,]. 

Clearly 

E[S\D,EJ<E[r\D,EJ<Ci^-^(lM)-'^°^'' . 

n 

Also, it holds that E[S\DuEs] < 1. Thus we get that 

E [S] < Ci^ (1.04)^^'°^" + n-'^^") + n-^. 
n 

Lemma follows by taking sufficiently large C. o 



Corollary A. 1.1. The bound for the expectation in (18) holds even if we remove 
an arbitrary set of edges of Gn^d/n- 

Proof: This holds since we have used Theorem 3.3 whose bounds are increasing 
in the degrees of the vertices. <> 



Lemma A. 1.3. With probability at least 1 — n the number of edges in R 
are at most n^'^, while the shortest cycle in G^^jn which contain each of them 
is of length greater than ioiogfe"d/2) • 

Proof: Note that if some edge in R belongs to a cycle of length less than e log n, 

for e = 1/(10 log(e^fi/2)), in Gn d/n^ then must exist a subgraph of G^^d/n that 
contains at most 2elogn vertices and has at least two cycles. 

In turn this implies that there is a set of vertices in Gn,d/n sharing with each 
other a number of edges that exceeds by 1, or more, the number of vertices. 
The size of this set is at most 2elogn. We are going to show that such a set of 
vertices does not exist in G^^dln with probability at least 1 — n~°'^. Let D be 
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the event that in Gn^/n there exists a set of r vertices which have r + 1 edges 
between them. For r < c log n we have the following: 

Pr[D] < J {dlnr\l - d/n)(^)-("+i) 

clogra / 9 \ r+l clogn , 9 ,\ r 

- n V 2 j 

Having c • log(e^(i/2) < 1, the quantity in the last derivation is o(l), i.e. it is of 
order 0{n'^^°^'^'' '^Z'^)^'^). Tims, for e as large as defined in the statement there 
is no connected component in Go which contains two cycles with probability at 
least 1 — n""'^ 

With standard probabilistic arguments we get that the expected number 
of cycles of length I in Gn^d/n l<2ss than S . By the linearity of expectation 
we conclude that the expected number of cycles of length at most elogn, is 
at most 0{ri}/^'^). The expected number of vertices in these cycles is at most 
0{ji}/'^'^ \ogn). All these imply, also, that the expected number of edges in R is 
at most 0{n^Z^^ \ogn). 

Applying the Markov inequality, we get that the probability of the event that 
the number of edges in R is at least rv^^^^ is upper bounded by n~^-^/^°. The 
lemma follows. o 



Remark. Lemma A. 1.3 implies that removing the edges in R, w.h.p. all the 
cycles of length less that lo io^gfe"d/2) become isolated. 

Lemma A. 1.4. Consider the sequence of subgraphs Q{Gn_d/n), defined in Sec- 
tion A.l. W.r.t. G{Gn^d/n) and when k is constant, with probability at least 
1 — n~'^, every Cij, for Q < i < io, defined in the statement of Theorem 3.2, is 
upper bounded by constants, c> is a constant as well. 

Proof: Considering the sequence of subgraphs Gift, . . . Gi^n, Uij and Vij are such 
that they are not adjacent in Gij while they become adjacent in Gij+i. By 
Theorem 3.2, we have that 

Cij = Cij{Gij,k) = max {{Pr[Xij{uij) = s\Xij{vij) = t])"^} . 

s,f6[fcj 

Note that expresses a correlation between the colour assignment of the 
vertices Vij and Uij in PCS{Gij,k). We will show that Cij is small by com- 
paring Pr[Xij{uij) = s\Xij{vij) = t] with Pr[Xij{uij) = s] and by showing 
that these two probability terms does not differ very much. It holds that 

\Pr[Xij{uij) = s\Xij{vij) = t]-Pr[Xij{uij) = s]\ < max 1 1«„ 

Lemma follows by showing that with sufficiently large probability it holds that 
max Wl^iji (■hvi.jOlky ^ inriT. for every (20) 
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Furthermore, there is a set of edges £ in Gn^/n such that for every e e 5 with 

e = {u>i , W2} there is set of pair of indices le x Je such that for (i, j) £ /g x Jg it 
holds Cjj = Pr[Xi^j(Mjj) = s|Xjj(tij^j) = t] where either Vij = wi and Uij = W2 
or Vij = W2 and Uij = wi . 

Corollary A. 1.1 implies the following: For every edge e G 5 such that the 
relation in (20) holds for all members of /g x Jg if it holds for the Gibbs distri- 
bution of PC5(G„^d/„\{e}, fc), with an abuse of notation we use G„,d/„\{e} to 
denote the graph Gn^d/n without the edge {e}. 

Due to the way we consider the edges in the schema the set of edges £ 
contain only edges such that the smaller cycle that contains them, in Gn^d/m is 
of length greater than elogn, where e ~ {lQ\og{e^d/2))~^ . Thus, for e G 5 with 
e = {x,y}, the vertices x,y are at distance at least 10 iog'fe"d/2) Grt,d/n\{e}- 
Thus, by Lemma A. 1.2, it holds 

~ n 



E 



max ||/i(-|(T^)-/x(-|?7^ 

o-,77ef2(G„_d/n,fc) 



Applying the Markov inequality and for fixed C we get 
Pr " ■ ^ 



max ||/i(-|«^x) -M(-hx)||j/ > -,„„, 



Using Chcrnoff bounds it is easy to show that with probability at least 
1 — n~'^("^ the number of edges in G^^d/n is at most ^(1 + o(l)) and con- 
sequently \£\ < 0{n) with the same probability. Lemma follows by taking the 
union bound over all the pairs x,y gV such that {a;, y} € £. o 



Lemma A. 1.5. Consider the sequence of subgraphs Q{Gn.d/n)j defined in Sec- 
tion A.l. W.r.t. Q{Gn,d/n) O'lT'd' when k > 2.1d all the probability terms Pr[Xi{vi) ^ 
Xi{ui)] are lower bounded by a constant, with probability at least 1 — n~'^ for c 
constant. 



Proof: We consider two cases for Vi and Wj: First, consider that the smallest cycle 

which contains both vi and ui is of length greater than ioio'g(^."d/2) • Second, we 
consider the opposite i.e. Vi, Ui belong to a small cycle. For the first case we note 
that 

\Pr[Xi{ui) ^ Xi{vi\ - (1 - ^) I ^ ^) - I^Mnvi)\\ui- 

We work in the same manner as in the proof of Lemma A. 1.4. 

In the later case, note that we consider Vi and Ui which belong to small cycle 
only when we have already removed the edges in R, in Gj. Then, all the small 
cycles become isolated. Thus, for the second case, where the vertices belong to 
a small cycle, Pr[Xi{ui) 7^ Xi{vi\ is trivially constant, since k » 2. Lemma 
follows. o 

Using Lemma 2.1 and the previous lemma we get the following corollary. 
Corollary A. 1.2. The log-partition function of Gn,d/n w.h.p. 0{n). 
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We have all the lemmas we need to show Theorem A. 1.1. 



Proof of Theorem A. 1.1: By Theorem 3.1 the error in the estimation depends 
on the sum 

According to Theorem 3.2, for < ^ < io it holds that 

\Pr\Xi{vi)i^Xi{ui)\-Pr\Xi^(i{vi)i^Xi^(i{ui)\\<S^Cij max ||Mij(-|o'x)-Mij(-|Tx)||i'i. 

3=1 xeiFij 

For i > iQ the schema makes the estimation of Pr[Xi{vi) ^ Xi{ui)] without 
computations. By Corollary A. 1.1 and by noting that Hi,ri{-) = we have 
that 

\Pr[Xi{vi) 7^ Xi{ui)]-Pr[Xifi{vi) Xi,o(wi)]| < max(Cij)-rr max \\lJ,i{-\ax)-Hi{-\Tro)\\^i 

where a; is a vertex at distance a log n from tf/j . Using Lemma A. 1 . 2 and Corollary 
A. 1.1 we get 



E 



max llMi(-lo-x) -/Xi(-|rx)|k^ 



< C2n-(i+"^°s7) (21) 



where < a < 2iogd > 7 is a fixed real greater than 1 and C2 is a sufficiently large 
constant. Wc remind the reader that the distance between x and ^ is alogn. 

The number of terms in the sequence Q{Gi) is equal to the number of edges 
between Li{^i, alogn) and Li{^i, alogn -\- 1). The number of vertices of a BFS 
tree of Bi {^i , a log n) is dominated by a Galton- Watson tree of a log n levels, with 
number of offsprings per individual distributed as in B{n,d/n). With standard 
arguments (e.g. see Theorem 6 in [18]), it holds that with probability at least 
1 — o(n~^), the number of vertices at level alogn, where < a < 2\ogd '' 
5n^/^ logn. Using Chernoff bounds we can show that the number of edges are 
©(n^/^ logn) with probability at least 1 — o(n~^). Thus ri = Qiv^l"^ logn) with 
probability at least 1 — o(n~^). 

Let D be the event that: a) max^j (Ci^j) is constant, b) maii{Pr\Xi(vi) ^ 
^i(uj)]} > 0.99/fc c) maxi{rj} < ©(n^/Mogn) and d) = n°-3. By previous 
lemmas we have that there is a constant e > such that Pr\D\ > 1 — n~*. 
Combining this fact with (21) we get 



E 



max ||rt(-|o-x) - Mi(-kx)|ki I -D 

<T,rei7(Gi,/c) 



< Cn-(^+"'°s7) 



for a sufficiently large C. Assuming that the event D holds, it also holds that 
\Pr\X,{vi) ^ X,{ui)\ - PT\X,^^(yi) Xi,o(«0]l 



Pt\X,{vO + X,{ui)\ 
Thus, by Theorem 3.1 we get that 
1 



= 0(1) for i > io. 



E 



^logZ{G,k)-logZ{G,k)\\D 
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Using the Markov inequality we get the following: Applying the schema on a 
graph Gnjijn where the event D holds, then with probability at least 1 — n~^/^ 
it holds that 

-|log^(G,/c) -logZ(G,A;)| < 6>(n-("'°s'>') logn) 
n 

Theorem follows by the fact that Pr[£)] > 1 — and Corollary A. 1.2. o 



A.1.2 Proof of Theorem A.1.3 

We will be based on Lemma 2.1 to show the theorem. 

Consider the following sequence of subgraphs ^(G„,d/rt) (different than the 
one stated in Section A.l): The term- graph Go is empty. There are indices io 
and i\ with < i\ while the followings hold: For < i < Zi, Gj contains all 
the edges that belong to cycles of length at most lo iog'fe"rf/2) ^n.rf/" '^^^'^ only 
these edges. Additionally, for < ?' < ?'o. G,; consists only of isolated vertices 
and connected components which are paths. We refer to the cycle of length less 
ioio'gfe"d/2) ^ "small cycles" 

Working as in the proof of Lemma A.1.3 we get the following: With proba- 
bility n~'^, for fixed c > 0, it holds that the number of edges in small cycles is 
n°'^ and all these "small cycles" do not share vertices with each other. 

By Lemma A.1.2 for PCSiGnAjm k), where k > 2. Id, for every x G V and 
C L{x, I), where < ^ < n^/^, there exists a real 7 > 1 such that 



E 



max \\^x{-\a^) - n{-\r]^ 



< C ■ -(^7-' (22) 
n 



where is the number of vertices in W and G is a sufficiently large real con- 
stants. By (22) and the Markov inequality we get that with probability at least 
n~^S it holds that 

for i > ii where ci > 0, €2 > are fixed reals. From the above inequality we get 
that for i > ii 

and 

\Pr\Xi{ui)i^2\Xi{v,)=3\-Pr{X,{u,)i^3\\ < M'WvJ - K'lVvMu,- 
Thus, we get that 

Pr[Xi{vi) Xi{ui)] = (^1 - (1 ± 0(1)) 

Note that the above relation holds for almost all values of i in {1, . . . ,r}. E.g. 

there is a set of values for i, call it ^ C {1, . . . r}, for which the above does not 
hold while \A\ = o{n). For < i < ii we assume that 



Pr[Xi{vi) Xi{ui)] 
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The error that in the estimations for < i < ii is constant. Note that all {vi, Ui}, 
for < i < ii belong to isolated cycles in Gi, of length less than (ioio'gfe"d/2)) • 
Furthermore, the error in the approximation of the log-partition is not affected 
by more than o(l), because ii = o{n). It is direct that it holds Z{Go,k) = fc". 
Theorem follows. 



A. 1.3 Proof of Theorem A. 1.4 

Let G{G) be the sequence of subgraphs we consider in the proof of Theorem 
A. 1.3. It is direct that the theorem will follow by showing that we can verify 
that 

for i > ii and appropriate fixed ei > 0, in polynomial time. Note that for Q{G) it 
holds that dist{vi,Ui) > ioio'^(^e"d/2) ' ^'^^ i > ii - Using Theorem 3.3 and Corollary 
A. 1.1 we have that 

^ 27^q[ 3 path of disagreement connecting {vi} and {ui} in 

where Vq is the the product measure defined in the statement of Theorem 3.3 
w.r.t. Gn,d/n and k. Consider the event 

Ey^f^ = "3 a path of disagreement that connects v with L(i',elogn)" 

for some fixed e > (101og(e^d/2)). For each pair Vi Ui define 



min( ^^f""^-\ (41og(e^d/2))-^V 
I log n J 



Noting that, for fixed ci, C2 such that ci > C2 it holds that VqlEy^a] < T^q[Ev,c2] 
we get that 

Vq[3 path of disagreement connecting {vi} and {ui} in G] < Vq[Ey^ai] 

It remains to show that VqiEy.^ai), for i > ii, can be verified that are sufficiently 
small in polynomial time. Let r„,_a, be the set of all simple paths that connect 
Vi to L{vi,ai log n), it holds that 

Vq[''m is a path of disagreement"] 

The computation of each probability term in the r.h.s. of the above inequality 
can be carried out in polynomial time. 

Furthermore, using Lemma 2.1 from [7] wc have that for for every Vi G 
Gn,d/n the graph B{vi,ailogn), is tree with at most one additional edge, with 
probability at least 1— n~°'^. It is direct that the number of simple paths between 
V and L{vi, ailogn) in B{vi, ailogn) is at most 2|L(i'i, logn)|. 

With standard arguments (e.g. see Theorem 6 in [18]), it holds that with 
probability at least 1 — o{n~^), \L{vi,ailogn)\ = n^'^^logn. Thus, |T„;,ai| is 
polynomially large w.h.p.. It is direct that the above sum can be computed 
in polynomial-time. Theorem follows by noting that with probability at least 
1 — the number of terms in Q{G) are G{n). 
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A. 2 Applications - Locally dense graphs 

We consider G = {V, E) to be a locally a-dense graph with bounded A and 
some a G [0, 1]. For a locally a-dense graph G, the cardinality of BV{v,A), for 
some V aV and /I C is decreasing in the parameter of the model a. Based 
on this remark and considering Theorem 3.4, we show that the r.h.s. of (12) is 
decreasing in the distance between v, A as long as A; > (2 — a) A. 

When we consider Q(G), note that deleting edges in G arbitrarily, we may 
increase the number of path between v,A whose vertices form an induced sub- 
graph which is bipartite. In terms of accuracy of counting, this is a undesirable 
event. The sequence of subgraphs we consider below is taken so as to control 
the cardinality of the set BP{vi,A) w.r.t. to the graph d. We denote this set as 
BVi{vi,A). 

Sequence of subgraphs. Let r be the number of terms in Q{G). Wc are going 
to describe the terms of the sequence of subgraphs in reverse order, i.e. starting 
from Gr and getting backwards. Clearly Gr is the same as G. Then choose some 
vertex of Gr, arbitrarily, let w be that vertex. In each of G^-i, Gr-2, ■ ■ ■ remove 
only edges which are adjacent to w. When w becomes isolated, choose another 
vertex and continue in the same manner until all vertices become isolated. 

For < i < r consider the sequence of subgraphs Q{Gi) as follows: Choose 
some vertex at distance t from let Wi be that vertex. For Gi^niGi^n-i ■ ■ ■ ■> 
remove only edges which are adjacent to Wj. Then, choose some other vertex at 
distance t from and continue in the same manner, until there are no vertices 
at distance t from 

Computing Probabilities. Set the parameter of the Counting Schema t = 
c log log n, for sufficiently small c > 0, such that the size of the connected com- 
ponent that contains {wi, Ui] in Gi^o, has at most 0{\ogn) vertices, for < i < r. 
For such component the number of colourings is polynomially large, i.e. it is at 
most fc'S'(iog")_ In this case it is direct that Pr[Xifi{vi) ^ Xifi{ui)\ can be com- 
puted by exhaustive enumeration. 

It turns out that we get a (logn)~^*^^''-approximation of log(Z(G, k)). At the 
end of this section we will see cases where the approximation of log(Z(G, k)) can 
be improved. 

In what follows, we provide the pseudocode of the counting schema for a 
locally a-dense graph G. The input parameter e controls the quality of the ap- 
proximation of log(Z(G, k)). Let k be of the form k= {1+ (3) A for some /3 > 0. 

Counting Schema G 
Input: G, /3, t, e 

If e < ((1 — a)/ PY compute log(Z(G, fc)) by exhaustive enumeration. 
Compute the sequence of subgraphs G{G). 
For < i < r do 

— Compute the exact value of Pr[Xifi{vi) ^ Xi^o(wi)]. 

- Set Z{G, k) = Z{G, k) ■ Pr[Xi^Q{vi) Xi,o(ui)]. 

End for. 

Set i-(G,fc) = Z{G,k)-k''. 
Output log (z{G, k)) /n. 
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Theorem A. 2.1. Consider G = (V, -B) a locally a-dense graph of bounded A. 

The Counting Schema when is applied to G with parameters fc = (1 + f3)A and 
t, for P such that 1 < a + /3, it returns an approximation of the log-partition 
function such that 



- log Z{G,k)- log Z{G,k) 

where C is a sufficiently large real constant. 



<C 



1 — a 



The proof of Theorem A. 2.1, makes a heavy use of Theorem 3.4, is presented in 
Section A.2.1. 

Note that for /3 such that 1 < a + /3, or equivalently, for k > {2 — a) A, the 
quantity is less than 1. 

Corollary A.2.1. It holds that log Z{G, k) = 0{n), for A, k fixed and k > A. 

Proof: Corollary follows by Lemma A. 2. 2 we provide for the proof of Theorem 
A.2.1 o 



Theorem 1.2 follows as a corollary from Theorem A.2.1, Corollary A.2.1 and 
the previous paragraph titled Computing Probabilities 

Improved accuracy. Taking t = clogn for sufficiently small c > 0, the size 
of the connected component that contains {vi,Ui} in Gi^, may become poly- 
nomially large. In this case the number of fc-colourings of Gi^o can be exponen- 
tially large and exhaustive enumeration is prohibited. However, if this component 
has bounded treewidth, then the junction tree algorithm (see [23]) can compute 
Pr[Xifi{vi) ^ Xifi{ui)] in polynomial time. In this case. Theorem A.2.1 and 
Corollary A.2.1 specify that we get a n~^^^^ -approximation of \og{Z{G, k)). 

The junction tree algorithm is a general algorithmic framework which allows 
calculations of marginals in settings as the one here. The detailed description of 
the algorithm is beyond to scope of this paper. For more details on the subject see 
[23]. We should, only, remark that given a tree decomposition of the component 
that contains {vi,Ui} in Gi^o, of width TR, the junction tree algorithm computes 
Pr[Xifi{vi) ^ Xifi{uif\ in time 0{n ■ k'^^). Note that the algorithm of Reed in 
[20] returns a tree decomposition of a graph Go of width at most TR (for fixed 
TR) if there is any. The time is needed is 0{N log N), where A'' is the number 
of vertices of Go- Corollary 1.1 follows. 



A.2.1 Proof of Theorem A.2.1 

Lemma A.2.1. All the quantities Gij defined in the statement of Theorem 3.2 
are upper bounded by some constant when k > A, and both k, A are fixed con- 
stants 

Proof: For any t G [k] it suffice to show that 

min {Pr[Xij{uij) = s\Xij{vij) = t]} < G'^ max{Pr[Xij(ui,j) = s\Xij{vij) = t]} 
se[k] se[k] 

(23) 
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where C is a sufficiently large constant. Note that in Gij the minimum graph 
distance between Vij and Uij is 2. For given t S [k] let sm G [k] be such that 

max Pr[Xi J {uij) = s\Xij{vij) =t] < Pr[Xij{uij) = SM\Xij{vij) = t] 

se[k] 

and similarly let s„j G [k] be such that 

mmPr[Xij{uij) = s\Xij{vij) = t] > Pr[Xij{uij) = Sm\Xij{vij) = t]. 
se[k] 

Let T e Q{Gij,k) be such that r^. ^. = For a e f2{Gij,k,Tvij) and s G [A;] let 

^(s, a) be the number of colourings of G^.j where ^ is coloured as s and the 
vertices in Bij[uij,2) are coloured as cr^. ^(m^ ^,2)- It holds that 

and 

Pr[Xij{uij) = Smks 2)] = ^r,(r ^hT^^ \\ 

\i2{Gij,k,aB..^^..^2))\ 

Which implies that 

Pr[Xi^jiUi,j) = Sm|0-B..,(u..,,2)] _ A{Srn,(7) 

Pr[Xij{uij) = SM\(TB,^^{ui,i,2)] A{sm,(t)' 

Clearly A{sm,<^) and A(.s„i,c7) depend on how many colourings arc available 
for the vertices in Lij{uij, 1) given that the vertices in Bij{uij,2) are coloured 
as crB-.(^uij,2) s-'^d the vertex Uij is coloured as Sm and Sm, correspondingly. 
Since k > A and both k, A arc fixed numbers, both A(,sm, c) and A{sm, a) are 
bounded and non zero. This implies that there exists a sufficiently large constant 
C that 

for any (^Bi j(ui j,2) ^ f2{Gij,k,Tv^^). Multiplying both sides in the above in- 
equality with Pr[Xij{Bij{uij,2)) = cr^. 2)] and taking the sum over all 
possible (Tb. .(„. ._2) we get that 

Pr[Xij{uij) = SM\Xij{vij) =t\<C ■ Pr[Xij{uij) = Sm\Xij{vij) = t] 

Lemma follows. <> 



Lemma A. 2. 2. For < i < r it holds that Pr[Xi{vi) ^ Xj(uj)] is lower 
bounded by some constant when k > A, and both k, A are fixed constants. 

Proof: Very similar to the proof of Lemma A.2.1. o 



Definition A.2.1. For some positive integer I, let BVi{^i,A,l) C BPi{\l/i,A) 
contain only the paths in BVii^i, A) which have length I. 

In the following lemma when we refer to a sequence of subgraphs Q (G) where 
we delete the edges as described in Section A. 2. 
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Lemma A. 2. 3. For any graph G = {V,E), consider the sequence of subgraphs 
Q{G). Assume that Q{G) = Go, ■ ■■ , Gr- For <i <r and ACVit holds that 

\BVi{^i,A,l)\< \l3Vr{w,A,l-l)\ 

where N^. is the set of neighbours of in Gr and I is an positive integer. 

Proof: For every vertex u £ let Gi^. be the term of G{G) which has the greatest 
index among the terms of G{G) having the vertex v isolated. 

Consider the permutation of the vertices in G, e.g. vi, . . . ,Vn, such that, the 
vertex Vi is the ith vertex that becomes isolated as we look Q{G) from the term 
r and backwards. Ties are resolved arbitrarily. For any vertex v €V and ACV 
it holds that 

\BVr{v,A,l)\ > \BVi^^{v,A,l)\. (24) 

For any simple path in Gr, that start from v, ends in A and contains vi, the 
removal of vi will just disconnect it. Clearly this implies that removing vi and 
all its adjacent edges from Gr, does not increase the cardinality of the paths 
that connect v with A such that their vertices form an induced subgraph which 
is bipartite. Repeating the above argument inductively we get that 

\BVi^^{v,A,l)\>\BVi^^^^iv,A,l)\ (25) 

for s = 1, . . . , n— 1. Note that > iv^+i ■ Consider, now, the graph Gj, in G(G), 
for some j G [ivs+i, ■ ■ - iv^ — It holds 

\BPj{1rj,A,l)\< \l3Pi^Aw,A,l-l)\. 

3 

Note that in the above inequality, each summand in r.h.s. allows to use the 
vertices in Wj. Lemma follow by noting that (25) implies 

\BPj{iyj,A,l)\< Y \13Vr{w,A,l-l)\. 

3 

o 

Proof of Theorem A.2.1: Let M = (xi . . . , xi) such that M G BVriixi}, {.tJ, l- 
1). Assume that we want to add in M a vertex xi+i such that the new path 
M' = (xi, . . . , xi+i) to belong to BVriixi}, {xi+i}, I). M can be extended only 
using vertices that are not adjacent to Since Gr is locally a-dense, from xi 
there are only, at most, (1 — a)A available vertices. 

Note that if the path could be extended to a vertex in the neighbourhood of 
X(_i, then xi and the new vertex would form a triangle. This would imply 
that M' ^ BPr{{xi},{xi+i},l). Consequently, the number of paths that start 
from V gV have length / and their vertices form a induced subgraph of Gr which 
is bipartite is, at most, A ((1 — a)Ay~^ . 

Theorems 3.2 and 3.4 state that 

\Pr[Xi{vi)J^Xi{ui)]-Pr[Xi,o{vi)J^Xi,o{ui)]\<J2C^J ^ ^ Eg[lM]. 

3 = 1 xelfi.j MeBVij{x,^i) 

(26) 
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The expectation in the r.h.s. is taken w.r.t. to product measure Vq specified by 

G and k. Let V'^ be the product measure specified by Gi for some i and k. It 
is direct that Vq dominates stochasticahy V'^, in the sense that some vertex is 
more probable to be disagreeing under the first measure. Consider the quantity 

BP{i) = j2\m,jmj,^ui)\- 

Using Lemma A. 2. 3, we get that 

\BVi,j{W'ij,^i,l)\< J2 \l3Vi{w,^i,l-l)\ 

Let L = U*lt_2Lr{^i,l), it holds that 

BP{1) < \BV^{w,^,,l-l)\ 
weL 

Noting that B'Pi{w,^i) = BVi{^i,w) and applying Lemma A. 2. 3 we get that 

\BPi{^i,w,l-l)\< J2 \BVr{y,w,l-2)\. 

Thus, for / > i (i is a parameter of the Counting Schema) we get that 
BP{l)<Y, \l3Pr{w,x,l-2)\<2A''{{l-a)Af-^ . 



We can write (26) as follows: 

I 

k-A. 



\Pr[Xi{vi) Xi{ui)]-Pr[Xi,o{vi) + Xi,o(ui)]| < maxC^ ^BP(/) f rr 



i>t 



For fc = (1 + /3)Zi we have 



1-1 



\Pr\X,{v,) ^ XiK)] - Pr[Xi,a{vi) + X,^^{u,)\\ < ^^^^^ g 

Taking 1 < a + P the ratio (1 — a)/f3 is smaller than 1. In this case we got: 
\Pr[Xi{vi) + Xi{ui)\ - Pr[Xi,oK) ^ Xi,o(«i)]l < ^^^^^ Ei>t ( 



1-1 



< Tii 



1 C^-aV 
1-1- • I 6 J 



Plugging in Lemma A. 2. 2 and Lemma A. 2.1 we get that for < z < r 
\Pr\XiivO + X,[u,)\ ~ PT{X,.^{vi) ^ Xi,oK)]| < c ^1 - ' 



Pr{X,{vi) ^ X,{u.;)\ ~ \ 13 

where C is a sufficiently large constant. Theorem follows by using Theorem 3.1. o 
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A. 3 Remarks on Locally a-dense graphs 



The following lemma provides a criterion under which a graph is locally a-dense. 
It is not too confining and allows a rich family of locally dense graphs. 

Lemma A. 3.1. Consider the graph G = {V,E) with maximum degree A. For 
each V G V let Qy = {Qi, Q2, ■ ■ ■ , Qs} denote the set of all maximal cliques that 
V belongs. If for every v gV and for every i G [0, . . . ,\Qy\] it holds 

E \Qj\Qi\<[{l-a)A\ (27) 
for some real a e [0, 1], then G is locally a-dense. 

Proof: Consider the graph G = {V, E) and some {u, u] G E. Note that the 
neighbours of v that are adjacent to u are only those that belong to the union of 
the maximal cliques that contain both v and u. Assume that one of the cliques 
that u belongs is Qi. Then the summation in (27) gives an upper boimd to the 
number of the neighbours of v that are not adjacent to u. This bound is exact 
if and only if, among the cliques inQy, u belongs only to Qi, otherwise it is an 
overestimate. Clearly, if for every v and i this sum is less than [(1 — Oi)A\ , then 
G is locally a-dense. o 

It is easy to check using the previous lemma that if G is a locally a-dense, 
there can be vertex v having a triangle free neighbourhood as long as its degree 
of V and the degrees of its adjacent vertices are at most (1 — a)A. Also, note 
that if the inequality (27) is not true, then G may still be locally a-dense. 



A. 4 Bounds for spatial correlation decay 

In this section we present the proofs of Theorem 3.3 and Theorem 3.4, stated in 
Section 3.2. 

For some finite graph G = iy, E) and some integer k such that f2{G, k) > 0, 
let /u(-) be the Gibbs distribution specified by PCS{G, k). For x eV, ACV and 
o;ri G fi{G, k), we are interested in deriving upper bounds for following quantity 

llM(-kx)-M(-|??x)|U. (28) 

Towards bounding the quantity in (28) we introduce two random variables X'^ , 
G [k]^ distributed as in iJ.{-\ax) and At(-|%), correspondingly. For any \1/ CV 
let X^i^) G [kf (X^i^) G [kf) be the random variable such that ii X" = ^ 
[X-^ = 0, then X-^i^) = (X''(ip') = ^^). We couple X" and X^i and use the 
Coupling Lemma (sec [2]) to bound (28) as follows: 

IImOx) - m(-|^x)IU < Pr[X''{A) X^{A) in the coupling]. 

We provide a upper bound for the probability of the event "X'^(yl) 7^ X^(A)" 
in the coupling, in terms of k and the degrees of the vertices in G by using 
^^disagreement percolation" , [5]. 

The identity coupling of X"' , X'^ is precluded due to the disagreement at 
X. The "disagreement percolation" coupling construction assembles the coupling 
(we have provide at the first place) in a stepwise fashion working away from x. 
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In our case it may be viewed as follows. The disagreement propagates into G 

along paths from x. A disagreement at vertex y E V at graph distance I from 
X propagates to a neighbour z at distance I + 1 if X"{z) ^ ^^{z)- A path of 
disagreement in G is any path which has all its vertices disagreeing. 

The disagreement percolation is dominated by an independent process . As 
we will show, the disagreement propagates along a self-avoiding path M (that 

starts from x) of length I, with probability at most nt)ex\{a;} k-A ' "-"^ (f^) • 
In the following subsection we describe the coupling of X"^ and X^. After- 
wards we present the proofs of Theorem 3.3 and Theorem 3.4. 



A. 4.1 The coupling for the comparison 

For coupling X'^ and X^, we exploit the bijection between Q{G,k,<Jx) and 
Q{G, k, r/x), provided in the following lemma. 

Lemma A. 4.1. There is a bijection T : f2{G,k,(jy) f2{G,k,r]x) such that 
for every ^ G Q{G,k,<Jx), the set = e V\£,y ^ {TClv} has the following 
properties: 

- for every v G it holds {TS,)v e {a^, ri^} 

— the graph G^ = (V{,-Ej), induced subgraph ofG, is connected and bipartite. 

Proof: The proof of this lemma is based on describing the actual bijection T. 
More specifically, we demonstrate how T maps a given member of f2{G, k, ax) to 
a specific member of k, rjx)- Then we show that the mapping T is actually 
a bijection between the sets n{G,k,a-x) and n{G, k,rjx) while the set Vj, as 
defined in the statement, has all the desired properties. 

For the colouring ^ e f2{G,k,ax), let G^ = {V^,E^) be the maximal, con- 
nected induced subgraph of G such that x G and Vw G it holds G 
WxtVx}- It is easy for someone to see that is a bipartite graph (in the ex- 
treme case where = {a;} we consider G^ bipartite too). The maximality of 
implies that if dV^ = {v e V\V^\{v,u} e E for u e V^}, then Vv G 9Vj it holds 
£.v ^ WxiVu}- It is crucial that the graph is connected. 

Clearly ^ specifies a proper 2-colouring for Gj that uses only the colours ax 
and rjx- In particular, let pi,P2 C be the two parts of G^ and w.l.o.g. assume 
that X belongs to pi . Then, ^ assigns to all the vertices in pi the colour a* and 
to all the vertices in p2 the colour r]x- 

We take T such that for every vertex v G V\V^ to hold (T^)t, = S,v For the 
remaining vertices, i.e. those that belong to Gj, the mapping T swaps the colour 
assignments of the two parts of Gj. More specifically, if ^ specifies for all the 
vertices in each of the parts pi and p2, of the graph G^, colour assignment ax 
and rjx correspondingly, then specifies for all the vertices in each of the parts 
Pi and p2 colour assignments r]x and ax, correspondingly. 

Lemma follows by showing that T is a bijection between Q{G,k,ax) and 
Q{G,k,'nx)- 

Claim A.4.1 For every ^ G Q{G,k,ax) it holds that (T^) G Q{G,k,rjx)- 
^'^ It is dominated by a Bernoulli site percolation. 
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Proof: It is direct that (T^)x = "Hx- It remains to show that is a proper 
colouring of G. 

If is a non proper colouring, then we will have, at least, two adjacent 
vertices (somewhere in G) having the same colour assignment. The swap of 
colour assignments that take place, when we apply T on ^, involves only vertices 
in Vj. Thus if (T^) is a non proper colouring, then the adjacent vertices with 
the same colour assignment can be either all in or some in and the rest in 

It is direct that swapping the colour assignments of the two parts of Gj, 
as these are specified by ^, leads to a proper colouring of Thus, in no 
adjacent vertices in can have the same colour assignment. Also, this swap 
of colourings cannot lead some vertex in to have the same colour assignment 
with some vertex in 9V^. Due to maximality of G{, the colouring ^ does not 
specify colour assignment that uses the colours <Tx and -qx for any vertex in . 
Thus, the swap in the colour assignments that take place when T is applied on 
^ does not lead to a non proper colouring. The claim follows. o 

The next claim shows that every member of f2{G,k,r]x) can be derived by 
applying T to an appropriate member of f2{G, k, ax), i.e. Claim A. 4. 2 shows that 
T is a surjective mapping. 

Claim A.4.2 The range ofT is Q{G,k,r]x)- 

Proof: Let ^' be any member of f^{G,k,r]x)- We are going to show that there 
exists ^ € 17(G, k, ax) such that = 

For the colouring let G^' = {V^',E^i) be the maximal, connected bipartite 
subgraph of G such that x € V^i and Vw e V^' it holds G WxtVx} (in the 
extreme case where V^/ = {x} we consider Gj' a bipartite graph). 

The colouring ^' specifics a proper 2-colouring for G|' that uses only the 
colours ax and rjx- Let pi-p-z C be the two parts of G^i and w.l.o.g. assume 
that ^' assigns to all the vertices in pi the colour r]x and to all the vertices in p2 
the colour ax- 

Consider the colouring ^ which is derived by ^' by swapping the colour as- 
signments of the two parts of Gj' while = for v £ V\V^'. With arguments 
similar to those in the proof of Claim A. 4.1 we can see that ^ G 17(G, k,ax)- The 
claim follows by noting that = o 

In the following claim we show that T is one-to-one. 

Claim A. 4. 3 The mapping T is one-to-one. 

Proof: Assume that there are two colourings G f2{G, k, ax) such that T^^ = 

T^2 _ ^3 -yy-g g^j.g going to show that it should hold = C^. 

Assume the contrary, i.e. ^ ^ . We consider the graphs G^i G^2 and G^3, 
as in the proofs of the two previous claims. By the proofs of two previous claims 
we know that the graphs G^i , G^2 and G^3 are isomorphic. 

We conclude that the colourings and ^ should differ only on the colour 
assignment of the vertices in the graph G^i . We remind the reader that this graph 
is a connected bipartite graph with ^ and ^ specifying proper 2-colourings for 
G^i which both using the colours {ax^rix}- 

By definition, the 2-colouring for G^i that ^ specifies is different than that 
of T^^ The same holds for colouring of and T^. Since T^^ = T^ we should 
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have three different proper 2-colourings for G^i . Noting that there exist only two 
proper 2-colourings for G^i we have a contradiction. The claim follows. o 

Since the mapping T : f2{G, k,ax) — » f2{G, k, r]x) is surjective (Claim A. 4. 2) and 
one-to-one (Claim A. 4. 3), it is a bijection. The lemma is proved. <> 



Lemma A. 4. 2. There exists a coupling of X"^ with X^ such that 

XV = TX" 

where T is the bijection defined in the statement of Lemma A.^.l. 

Proof: The existence of the bijection T implies that |J?(G, k, ax)\ = \^2{Gi, k, rix)\- 
Thus e Q{G, k, a^) it holds that 



\Q{G,k,ax)\ 

This impHes that Pr[X'^ = i\ = PriX"^ = T^], € Q{G,k,(Jx). The lemma 
follows by noting that 

Pr[X'' = ) = 1 and [ ^^I^" = (^^)] ) = ^■ 



Definition A. 4.1. Let G'^{V'^, E'^} denote tlie probability space over all the in- 
duced subgraphs ofG such that = {u £ V\X'^{u) ^ X'^{u) in the coupling of Lemma A. 4-2}. 

The definition of the bijection T implies G"^ is, always, connected and bipartite. 

Theorem A. 4.1. With the above definitions and Pr[E] denote the probability 
of the event E C [k]^ x [k]^ to occur in the coupling defined in Lemma A. 4- 2, 
it holds that 

M-\ax)-i^{-M\\A<Pr[V''nA^9] (29) 
Proof: Theorem follows by the Coupling Lemma (see in [2]). <> 

Wc should remark that Lemma A. 4.1, Lemma A. 4. 2 and Theorem A. 4.1 not 
only do relate the quantity ||/x(-|cr!f) — ^i{-\t]^)\\a with the probability of having 
a sufficiently "large" disagreement, but they give extra information regarding 
its "shape" and its "colouring" as well. More specifically, the vertices in the 
disagreement form a connected bipartite graph and their colour assignments use 
only (Tx and r]x- We close the presentation with a lemma which is going to be very 
useful in the following sections, where we apply the disagreement percolation. 

Lemma A. 4. 3. For every u G y\{a;}, let 7V„ be the set that contains all the 
vertices which are adjacent to the vertex u in G. Also, let S„ C [fc]-'^" x [kY^^ be 
defined such that 

Bu = {ee [kf- X [kf-\Pr[e] ^ 0}. 
If k > A, then it holds that 

^..Pr[X^u)^X^iu)\E]<^^ 

where is the degree of vertex u in G. 
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Proof: Lemma A. 4.1 and Lemma A.4.2 suggest that given the value of X" , the 
random set is specified with probabiUty 1. This impUes that 



max Pr[X%u) ^ X^{u)\E] = ^ max^^ Pr[X^v) ^ X^ {v)\X^V\{v}) = ^y^^^y]. 

We remind the reader that the coupUng of Lemma A.4.2 is such that is 
coloured using only the colours and rjx- 

There are two necessary conditions for some vertex v G to be in V^. 

The first one is that some vertex in Nu should, also, belong to V^. This is due 
to the fact that is connected. The second is the following one: Assume that 
'Wi S Nu and wi e V*. If there exists W2 € Nu\{wi} and X'^{'W2) S {(Tx,i1x}, 
then it should hold X'^{wi) = X'^{w2)- This should hold since Gd is bipartite. 

Considering the two previous conditions the worst case of X''{Ny) is the 
following: At least one vertex in Nu belongs to V^, call this vertex w. No vertex 
in Nu uses the colour {axTrix}\{X'^ {w)] . X"'{Nu) is such that the number of 
different colour that are used is equal to |A^„|. In that case the probability of u 
to belong to is j^J-^ . The lemma follows. o 
Lemma A.4.3 assumes that k > A, otherwise it holds 

max PrlX^iu) 7^ X'>{u)\E] < 1. 



A.4.2 Proof of Theorem 3.3 

By Theorem 1 and Corollary 1.1 in [5], and Lemma A.4.3 we get that 

||a*('|o'x)— A*("l'?a;)IU — path of disagreement between {x} and a vertex in A] 

(30) 

where q is as defined in the statement of the Theorem 3.3. 

We have to remark here that the coupling on which the disagreement perco- 
lation is based, has the following property: Let t be the minimum integer such 
that there is no path of disagreement connecting x to L{x, t). Then, our coupling 
specifies that no vertex in L{x, t'), for t' > t can be disagreeing. This is a crucial 
property of our coupling, since otherwise we could not apply the disagreement 
percolation technique (see [12]). 



A.4.3 Proof of Theorem 3.4 

In Theorem 3.3 it is not taken into consideration that the disagreeing vertices of 
the coupling between X"^ and X"^ should, always, form an induced subgraph of 
G which is bipartite. Plugging in this property of the disagreement we improve 
the bound in (11) by restricting to paths of disagreement whose vertices form 
an induced subgraph of G which is bipartite. 

For the definition of the set BP{x, A) and the variables Tm the reader should 
see in Section 3.2 (before the statement of Theorem 3.4). 

Proof of Theorem 3.4: Consider the vertices of the path M G BV{x, A) and 
let {x,xi, . . . ,xi) be the order in which we see the vertices in M as we traverse 
it from {x} to A. Let 

Pxj = Pr[xj is disagreeing! xt is disagreeing <t < j]. 
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Clearly it holds that Pr[lM = 1] = Px, ■ By Lemma A.4.3 it holds that 
< thus 

By Theorem A. 4.1 we have that 

- /"(•ka;)|U < Pr [y^M(^BV(x,A)'^M\ ■ 

Theorem follows by applying the union bound. <> 
A. 5 Proofs 

A.5.1 Proof of Theorem 3.1 

Let 

ern = \Pr[Xi{vi) ^ Xi{ui)] - Pr[Xi,o{vi) Xi^o{ui)]\ for < i < r - 1. 

logZ(G,fc) = E[=o log(-P[^-o(«*) 7^ X,^oM]) + logZ{Go,k) 

< ElJo log iP[X^{v^) + + err,) + logZ(Go, A:) 

< EI=o log {P{Mv,) + + EI=o log (} + tto^toh) + ^) 

< log Z{G, k) + ElJo' log (l + pix.kI^xu^Oi ) 



<logZ(G,fc) + Elro^ppf7(^;pxlK 



)] 

The final derivation follows by the the fact that log(x) is an increasing function 
(since the base is of the logarithm is e > 1) and by the inequality I + x < , 
for any x. Similarly we get the lower bound for log(Z(G, k)). Theorem follows. 

A.5.2 Proof of Theorem 3.2 

Theorem 3.2 follows as a corollary of two following lemmas. 
Lemma A.5.1. It holds that 

n-i 

\Pr[Xi{vi) Xi{ui)\ - Pr[Xi,oK) 7^ X,^^{ui)\\ < ^ \\nij{-) - 

3=0 

Proof: Clearly the Gibbs distributions Hi and iJ^i^n are the same. 

\Pr[Xi{vi) ^ Xi{ui)]-Pr[Xifl{vi) ^ Xifi{ui)]\ < max |rt,o(A)-/ii(^)| < ||/Xi,o(-)-Mi,n(-)lki 

AC[fc]*i 



By the triangle inequality we get that ||/Xi,o-Mi,ri||a'i < EjLo^ 
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Lemma A. 5. 2. Let A be any subset of vertices of Gij that does not contain Vi 
and Ui . It holds that 

IIWJ -Mi,j+l|U < C'iJ ■ 1 2™o/^ ^JI/^i,j(-kx)-Mij(-|o-x)IU (31) 

where Qj = Cij{Gij,k) = maxs^t^[k] {{Pr[Xi,j{uij) = s\Xij{vij) = t])"^}. 
Proof: With relatively simple argument we can get that 

where r G Q{Gij+i, k) is taken so as t^. ,^ to maximize the r.h.s. in the above 
inequality. By the fact that i?(Gij+i,fc) C Q{Gij,k) and by the conditional 
independence, it holds that Mi,j+i(-|Tij/j,j) = IJ'i,j{'\T^ij)j thus 

By definition (see (3)), there exists a set ^ C [k]^ such that ^ij{A) > 0^^ and 

It holds that 
which implies that 

<-^^¥^-|k,i(-|^)-W,,(-)lk, (32) 

<r¥^- E iK.(-l^)-M.i(-)llx 

The inequality in the second line holds since 

The inequality in the third line holds by the fact that 

\\f^i,j{-\-^) - w,j(-)lk*j < E ll^».j(-l-^) - w,j(-)llx- 

(See Section A. 5. 3 Proposition A. 5.1 for a proof of the above inequality, which is 
based on coupling.) For every x G tf^j, there exists ^ C [fc]^ such that iiij{A) > 
and 

- Mi,j(-)llx = ll^ijiM^) - t^i,j{A)\. 

It holds that 

IJ-ijjAlA) ^ iJ-i,j{A\A) 
Mi,j(^) ft,j(-A) 



It is direct that if /J-ijiA) = 0, then ^^.^(-^k^i.j) = 0. 
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which imphes that 
It is direct that 

\tii,j{A\A) - tJ.ij{A)\ < max \\^li,j{■\cTl) - ^li,j{■\al)\\A. 
Thus, we get that 

1/k 



llMij(0-Mi,j(-|T*-i,JIU < — -7- 7- V ^ max WfJ^iji-WD-mji-lal) 

After some trivial derivations the lemma follows. 



A. 5. 3 Properties of Total Vciriation distance 

Proposition A. 5.1. For any a G Q{Gij,k), and any ACVit holds that 

Proof: The convenience wc abbreviate Gi_j, l-M.j{-) and to G /i and t?", cor- 
respondingly. The proof is going to be made by coupling. Consider two systems 
Si = PCS{G, k, a a) and ^2 = PCS{G, k). Let v, u be the two vertices that be- 
long to 'I'. Also, let (X^,, Xu), {Y^, Yu) G [fc] x [k] be two pairs of random variables 
which are equal to the colour assignments of the pair of vertices (ii, u) in the 
systems S\ and ^2, correspondingly. 

Let Popt[{Xy,Xu) ^ (Ku,Ku)] be the probability to have disagreement in 
the optimal coupling of {Xy,Xy) with {Yy,Yy)- Let also P^p^[Xy ^ Yy] be the 
probability to have disagreement in the optimal coupling of Xy with Yy. Let 
P^p^ [Xu 7^ Yy] be the probability to have disagreement in the optimal coupling 
of Xu with Yu- 

The coupling lemma (in [2]) suggests that it suffices to show that 

P,pt[{Xy,Xy) ^ {Yy,Yy)] < P^'^JX^ ^ Yy] + P" JX„ ^ Yy] . 

We are going to give an optimal coupling of {Xy,Xy) with (F„,y„) which, 
at the same time, couples optimally Xy with Yy and X„ with Yy. Let P[E] be 
the probability of the event E to occur in the coupling wc give, where E G [fc]^. 

The coupling consists in partitioning [0, 1] into, at most, fc^ subintcrvals. Each 
of these subintcrvals will correspond to a value of the 4-tuple {Xy,Xy,Yy,Yy). 
The length of the interval that corresponds to [Xy^ Xy,Yy,Yy) = (si, .S2, -^3, .''4) 
will be equal to P[{Xy,Xy,Yy,Yy) — (51,52,53,54)]. Also, we consider the ran- 
dom variable U that is distributed uniformly in the unit interval [0, 1] . Depending 
on which subinterval of [0, 1] the r.v. U belongs, we assume the corresponding 
value for the four random variables. 

Note that the length of the subintcrvals should be taken such that 

P[Xy = s]= Pr[Xy = s] Vs G [A;]. 
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The analogous should hold for the marginal distributions of X„, Yy and 1^. 

We expect to exist subintervals of [0, 1] of four kinds. The first kind corre- 
sponds to those subintervals for which it holds Xy = Yy and Xy = Yy. The 
second one corresponds to those that Xy =Yy and X„ ^ Yy. The third corre- 
sponds to those that Xy ^ Yy and Xy = Yy. And the last one, corresponds to 
those that it holds Xy ^Yy and X„ ^ Yy. 

Consider, first, the subintervals of [0, 1], named ls,t, for s,t & [k]. The length 
of the interval Zs,t is such that 

=min{Pr[(X„,X„) = {s,t)], Pr[{Yy,Yy) = {s,t)]} . 

The probability for U to belong to any of these intervals is equal to the sum of 
their lengths, i.e. 

P[{Xy,Xy) = {Yy,Yy)]= ^ 

s,te[fc] 

These are the subintervals of the first kind. Furthermore, we can see that there 
cannot be any coupling that achieves smaller probability than \ls^t\ for 

the event ''{Xy, Xy) = {Yy,Yuf . Thus, we get that 

Popt[{Xv,Xy) ^ {Yy,Yy)] = P[{Xy,Xy) ^ {Yy,Yy)]. (33) 

Until know we have shown that our coupling is optimal w.r.t. to coupling the 
pair {Xy,Xy) with {Yy,Yy). 

Furthermore, if for s G [k] it holds Pr[Xy = s\,Pr\Yy = s] > '}2,te[k] \^s.t\, 
then we can have a subinterval of [0, 1] which corresponds to the event, "X^ = 
Yy = s, Xy ^ Yy \ Lct I g bc the name of this subinterval. It is direct that it 
holds 

|^,,*| = uiin{Pr[Xy = s],Pr[X„ = s]} - ^ 

te[fe] 

Also, we can see that in our coupling it holds that 

P[Xy = Yy]=Y, min {Pr[X„ = s\,Pr[Xy = s]} . 
selk] 

Furthermore, we can see that there cannot be any coupling that achieves greater 
probability than the r.h.s. of the above equality for the event ''Xy = Yy" . This 
implies that 

p„;jx„ ^ K] = ^ n] (34) 

Similarly, if for t G [k] it holds Pr[Xy = t],Pr[Yy = t] > Es6[fe] I's.*!' ^^^^ 
we can have a subinterval of [0, 1] which correspond to the event, "Xy = Yy = 
t,Xy ^ Yy" . Let U^t be this subinterval. It direct that it holds 

\h,t\ = ram{Pr[Xy=t],Pr[Xy=t]}- ^ \h,t\. 

se[k] 

Also, we can see that in our coupling it holds that 

P[Xy = Yy] = Y^ min {Pr[Xy = t],Pr[Xy = t]} . 
te[k] 
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Furthermore, we can see that there cannot be any coupUng that achieves greater 
probability than the r.h.s. of the above equahty for the event "X„ = Y^". This 
implies that 

p„"pjx„ ^ y„] = ^ y„] (35) 

The proposition follows by (33), (34) and (35) and, also, by noting that 



A. 5.4 Proof of Lemma 2.1 

Consider the sequence of subgraphs Q{G) = Go, ... , Gr, where r = \E\ and Go 
is empty. Consider, also, the following telescopic relation 

Lemma will follow by showing that 

|^(G,+i,fc)| 



\n{Gi,k)\ 



The above relation clearly holds by noting the following: The set of fc-colourings 
of Gj+i is the same as the subset of fc-colourings of Gj that contains all the 
colourings that assign Vi and different colours. Lemma follows. 



A. 5. 5 Proof of Lemma 3.1. 



= 2^^^^^) X! Im(o-a|(Tx) -m(o-a)| 
= 2 Z_/ M(o-A)|/i(o-a;kA) -M(f^x)| 

-2 X M(o-a) ^ |M(T-a;|cr.l) -M(Tr.)l 
<fc ^ /i(c7yl)||/i(-|£7yl) -/i(-)||^. 

Noting that it holds 

ll/^(-kx) - m(-|tx)IU < llM(-kx) - /"(OIU + IIm(-) - m(-|7"x)IU 
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we get the second relation of the lemma. For the first inequality we have the 
following: 



= 1- J2 E K^.)-Hm)-(^{A)\ 



2 



2 

E '^(^-) • I f 



= E MU-IK-IU-m(-)IU 

< E max ||/i(-kx) -m(-|tx)|U 

= max^^,T-,e[fc]x \\fJ,{-\a^) - fJ-{-\T^)\\A 
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